The BAS Speech Data Repository

نویسندگان

  • Uwe D. Reichel
  • Florian Schiel
  • Thomas Kisler
  • Christoph Draxler
  • Nina Pörner
چکیده

The BAS CLARIN speech data repository is introduced. At the current state it comprises 31 pre-dominantly German corpora of spoken language. It is compliant to the CLARIN-D as well as the OLAC requirements. This enables its embedding into several infrastructures. We give an overview over its structure, its implementation as well as the corpora it contains.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

New resources at BAS: acoustic, multimodal, linguistic

Four new speech corpora have been added to the catalogue and will be briefly described. The BAS pronunciation dictionary PHONOLEX was extended by two new types of entries: empirically detected pronunciation variants and empirically collected word entries. Details will be given about the new Verbmobil II speech resources soon available via BAS and the European SpeechDat Car data collection. Apar...

متن کامل

The bavarian archive for speech signals: resources for the speech community

This paper gives an overview of the activities at the Bavarian Archive of Speech Signals (BAS) that was founded as a non-pro t organization in 1995. The main purpose of BAS is the development of a Complete Phonetic Theory (CPT) of German based on the empirical exploitation of very large databases of spoken German. However, on our way to that goal BAS will act as a focal point for all computer r...

متن کامل

Bavarian Archive for Speech Signals ( Bas ) Status Report 1995 - 2000

Outline of this Report The Bavarian Archive for Speech Signals (BAS) is a joint initiative of the Bavarian State and the Ludwig Maximilians Universität München. It is located at the host organisation Institut für Phonetik und Sprachliche Kommunikation and collects, evaluates, produces and disseminates speech based resources to the scientific community. Our focus is the German language covering ...

متن کامل

Speech and Speech Related Resources at BAS

The Bavarian Archive for Speech Signals BAS located at the Ludwig Maximilians Universit at M unchen Ger many collects evaluates produces and disseminates Ger man speech resources to the scienti c community Our focus is the German language covering a large geographi cal part of central Europe Speech and speech related resources are usually produced for certain tasks or projects Therefore it is n...

متن کامل

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

In this paper we outline the German speech data collection for the SmartWeb project, which is funded by the German Ministry of Science and Education. We focus on the SmartWeb Handheld Corpus (SHC), which has been collected by the Bavarian Archive for Speech Signals (BAS) at the Phonetic Institute (IPSK) of Munich University. Signals of SHC are being recorded in real-life environments (indoor an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016